Parallel Numerical Algorithms: an Introduction
نویسنده
چکیده
The rapidly evolving field of parallel computing has seen four eras, substantially overlapping, yet progressive. The first, which we could label the Stone Age, was characterized by having speed-up as its primary objective (" Solve my problem faster... "). The tyranny of Amdahl's Law, combined with architectural convergence to systems that added memory in fixed proportion to processors, led to the Bronze Age, characterized by pursuit of scaled speed-up (" Solve my problem bigger, in constant time... "). The cost associated with special purpose high-performance hardware and the even greater cost of developing reliable special purpose system-level software led to the Iron Age of commodity processors — clusters of full-function computers connected by either a special purpose network or a commodity network like Ethernet or FDDI (" Solve my problem cheaper... "). Though technology and market forces have not been eclipsed in their influence on parallel tool-building and parallel applications, we can now point to a growing influence of algorithmic development and software environment development on the envelope of computabil-ity (" Solve my problem smarter... "). Perhaps, through continued progress in architecture-adaptive algorithmic design that improves upon " best fit " flop-for-flop algorithmic porting, we will soon be able to refer to a Golden Age of parallel computing. The goals of the ICASE program in parallel numerical algorithms are to accelerate, widen, and enhance the use of high performance parallel scientific computing within NASA computational applications. The first goal refers to hastening the transition to parallel
منابع مشابه
Hybrid artificial immune system and simulated annealing algorithms for solving hybrid JIT flow shop with parallel batches and machine eligibility
This research deals with a hybrid flow shop scheduling problem with parallel batching, machine eligibility, unrelated parallel machine, and different release dates to minimize the sum of the total weighted earliness and tardiness (ET) penalties. In parallel batching situation, it is supposed that number of machine in some stages are able to perform a certain number of jobs simultaneously. First...
متن کاملA New Parallel Matrix Multiplication Method Adapted on Fibonacci Hypercube Structure
The objective of this study was to develop a new optimal parallel algorithm for matrix multiplication which could run on a Fibonacci Hypercube structure. Most of the popular algorithms for parallel matrix multiplication can not run on Fibonacci Hypercube structure, therefore giving a method that can be run on all structures especially Fibonacci Hypercube structure is necessary for parallel matr...
متن کاملModeling and scheduling no-idle hybrid flow shop problems
Although several papers have studied no-idle scheduling problems, they all focus on flow shops, assuming one processor at each working stage. But, companies commonly extend to hybrid flow shops by duplicating machines in parallel in stages. This paper considers the problem of scheduling no-idle hybrid flow shops. A mixed integer linear programming model is first developed to mathematically form...
متن کاملTwo meta-heuristic algorithms for parallel machines scheduling problem with past-sequence-dependent setup times and effects of deterioration and learning
This paper considers identical parallel machines scheduling problem with past-sequence-dependent setup times, deteriorating jobs and learning effects, in which the actual processing time of a job on each machine is given as a function of the processing times of the jobs already processed and its scheduled position on the corresponding machine. In addition, the setup time of a job on each machin...
متن کاملA BROADCAST CUBE-BASED Ib'IULTIPROCESSOR ARCHITECTURE FOR SOLVING PARTIAL DIFFERENTIAL EQUATIONS
A large number of mathematical models in engineering and physical sciences employ Partial Differential Equations (PDEs). The sheer number of operations required in numerically integrating PDEs in these applications has motivated the search for faster methods of computing. The conventional uniprocessor computers are often unable to fulfill the performance requirements for these computation inten...
متن کاملParallel Implementation of Particle Swarm Optimization Variants Using Graphics Processing Unit Platform
There are different variants of Particle Swarm Optimization (PSO) algorithm such as Adaptive Particle Swarm Optimization (APSO) and Particle Swarm Optimization with an Aging Leader and Challengers (ALC-PSO). These algorithms improve the performance of PSO in terms of finding the best solution and accelerating the convergence speed. However, these algorithms are computationally intensive. The go...
متن کامل